No-Regret Learnability for Piecewise Linear Losses

نویسندگان

  • Arthur Flajolet
  • Patrick Jaillet
چکیده

In the convex optimization approach to online regret minimization, many methodshave been developed to guarantee a O(√T ) bound on regret for subdifferentiableconvex loss functions with bounded subgradients, by using a reduction to linearloss functions. This suggests that linear loss functions tend to be the hardest onesto learn against, regardless of the underlying decision spaces. We investigate thisquestion in a systematic fashion looking at the interplay between the set of pos-sible moves for both the decision maker and the adversarial environment. Thisallows us to highlight sharp distinctive behaviors about the learnability of piece-wise linear loss functions. On the one hand, when the decision set of the decisionmaker is a polyhedron, we establish Ω(√T ) lower bounds on regret for a largeclass of piecewise linear loss functions with important applications in online lin-ear optimization, repeated zero-sum Stackelberg games, online prediction withside information, and online two-stage optimization. On the other hand, we ex-hibit O(log T ) learning rates, achieved by the Follow-The-Leader algorithm, inonline linear optimization when the boundary of the decision maker’s decision setis curved and when 0 does not lie in the convex hull of the environment’s decisionset. These results hold in a completely adversarial setting.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Stability Conditions for Online Learnability

Stability is a general notion that quantifies the sensitivity of a learning algorithm’s output to small change in the training dataset (e.g. deletion or replacement of a single training sample). Such conditions have recently been shown to be more powerful to characterize learnability in the general learning setting under i.i.d. samples where uniform convergence is not necessary for learnability...

متن کامل

Regret bounds for Non Convex Quadratic Losses Online Learning over Reproducing Kernel Hilbert Spaces

We present several online algorithms with dimension-free regret bounds for general nonconvex quadratic losses by viewing them as functions in Reproducing Hilbert Kernel Spaces. In our work we adapt the Online Gradient Descent, Follow the Regularized Leader and the Conditional Gradient method meta algorithms for RKHS spaces and provide regret bounds in this setting. By analyzing them as algorith...

متن کامل

Agnostic Online Learning

We study learnability of hypotheses classes in agnostic online prediction models. The analogous question in the PAC learning model [Valiant, 1984] was addressed by Haussler [1992] and others, who showed that the VC dimension characterization of the sample complexity of learnability extends to the agnostic (or ”unrealizable”) setting. In his influential work, Littlestone [1988] described a combi...

متن کامل

Agnostic Online learnability

We study a fundamental question. What classes of hypotheses are learnable in the online learning model? The analogous question in the PAC learning model [12] was addressed by Vapnik and others [13], who showed that the VC dimension characterizes the learnability of a hypothesis class. In his influential work, Littlestone [9] studied the online learnability of hypothesis classes, but only in the...

متن کامل

Stochastic Convex Optimization

For supervised classification problems, it is well known that learnability is equivalent to uniform convergence of the empirical risks and thus to learnability by empirical minimization. Inspired by recent regret bounds for online convex optimization, we study stochastic convex optimization, and uncover a surprisingly different situation in the more general setting: although the stochastic conv...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1411.5649  شماره 

صفحات  -

تاریخ انتشار 2014